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KAW SEQUENCE LISTING 

RATENT APPLICATION US/08/269J18A 



DATEPi 2/20/S6 
TIME: 16:22:17' 



INPUT SET: S14548.ruw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 




SEQUENCE LISTING 

2 

3 (X) General Information: 

5 (i) APPLICANT: Inouye, Sumiko 

6 Hsu, Mei-Yin 

7 Eagle, Susan 
q Inouye, Masayori 

10 (ii) TITLE OF INVENTION: Prokaryotic Reverse Transcriptase 

12 (iii) NUMBER OF SEQUENCES: 45 
13 

14 (iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: Weiser & Associates 

16 J B ) STREET: 230 South Fifteenth Street, Suite 500 

17 (C) CITY: Philadelphia 

18 (D) STATE: Pennsylvania 

19 (E) COUNTRY: U.S.A. 

20 (F) ZIP: 19102 
21 

22 (V) COMPUTER READABLE FORM: 

23 (A) MEDIUM TYPE: Floppy disk 

24 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

26 ( D ) SOFTWARE: Patentln Release #1.0, Version #1.25 

27 

28 (Vi) CURRENT APPLICATION DATA: 

2 9 (A) APPLICATION NUMBER: US 08/269,118 

30 (B) FILING DATE: 30-JUN-1994 

31 (C) CLASSIFICATION: 
32 

33 (Viii) ATTORNEY/ AGENT INFORMATION: 

34 (A) NAME: Weiser, Gerard J. 

35 (B) REGISTRATION NUMBER: 19,763 

36 ( c) REFERENCE/ DOCKET NUMBER: 377.5888P 

37 

38 (i X ) TELECOMMUNICATION INFORMATION: 

39 (A) TELEPHONE: 215-875-8383 

40 (B) TELEFAX: 215-875-8394 

41 
42 

4 3 (2) INFORMATION FOR SEQ ID NO:l: 
44 

45 (i) SEQUENCE CHARACTERISTICS: 

46 (A) LENGTH: 2176 base pairs 




4 



PAG£: 2 



47 

48 

49 

50 

51 

52 

53 

54 

55 

56 

57 

58 

59 

60 

61 

62 

63 

64 

65 

66 

67 

68 

69 

70 

71 

72 

73 

74 

75 

76 

77 

78 

79 

80 

81 

82 

83 

84 

85 

86 

87 

88 

89 

90 

91 

92 

93 

94 

95 

96 

97 

98 

99 



INPUT SET: S14S48. raw 



RAW SEQUENCE LISTING „ nA 
PATENTAPPLICATION VS/OSnb^mA 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 640.. 2094 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TCATCCGCGC GGACACCCCC TCCTACGTGC CCCCCGACGC GGAGAGCGGC GTGGAGACGG 

TGTACCGCGT TTCCCTGGAT GGTCACCTGG TGGCGGTGGA GTGGGGCCCG CGCACGGGCT 

CGCCGCGTCA CCAGCGGCTC TGGTTCGACT CGGATGCGGA AGCCCCCGGA GCCTACTTCG 

CGCGCCTCGA GAAGTTGGCG GCTGACGGCT ACATCGACGC GGCCTCGGCA TTGGTCTAAA 

CCCTTCAACC ACGGCTCGGC CGCCACGCGC GGCCGGCAGG ACAGGTGCGA CGAACAGACG 

ACGACGTGCG CTTCACGCGC GAGCAGCCGA GAGAGGTCCG GAGTGCATCA GCCTGAGCGC 

CTCGAGCGGC GGAGCGGCGT TGCGCCGCTC CGGTTGGAAT GCAGGACACT CTCCGCAAGG 

TAGCCTGTTC TTGGCTCTCT CCCTCCTAGG CACTACGGCC AGGGTGGGTA GCGGAGCCAA 

CGACGCCACC GCCGTTTACC CACCCCGGCC GTAGTGCCTA GGAGGGGAGA GCCGGTGAGG 

CTACCGTGCC CCAGGTAAGA TGGTGGTGCT TTCCCGGCCT CCGTCGACTG CTCGCGCCAT 

GTCCCGTCTT CCATCGCCGC GCCCGCCCAA GGTGCAGAC ATG ACC GCC AGG CTG 

Met Thr Ala Arg Leu 
1 5 

GAC CCG TTC GTC CCC GCA GCT TCG CCG CAG GCC GTG CCC ACG CCC GAG 
Asp Pro Phe Val Pro Ala Ala s Ser Pro Gin Ala Val Pro Thr Pro Glu 
10 15 20 

CTC ACC GCT CCG TCG TCA GAC GCG GCC GCG AAG CGT GAA GCC CGC CGG 
Leu Thr Ala Pro Ser Ser Asp Ala Ala Ala Lys Arg Glu Ala Arg Arg 
25 30 35 

CTC GCG CAC GAA GCG TTG CTC GTC CGC GCG AAG GCC ATC GAC GAA GCG 
Leu Ala His Glu Ala Leu Leu Val Arg Ala Lys Ala He Asp Glu Ala 
40 45 50 

GGC GGC GCC GAC GAC TGG GTG CAG GCG CAG CTC GTC TCC AAG GGG CTC 
Gly Gly Ala Asp Asp Trp Val Gin Ala Gin Leu Val Ser Lys Gly Leu 
55 60 65 

GCG GTC GAG GAC CTG GAC TTC TCC AGC GCC TCC GAG AAG GAC AAG AAG 



DATE: 12/20/96 
TIME: 16:22:2 V 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
654 

702 

750 

798 

846 

894 



PAGE . 3 RAW SEQUENCE LISTING * DATE: 12/20/96 

V < /vv PATENTAPPLICATION US/08/269J18A TIME: 16:22:24 

INPUT SET: S14S48.mw 
Ala Val Glu Asp Leu Asp Phe Ser Ser Ala Ser Glu Lys Asp Lys Lys 



100 

101 70 
102 

103 GCC TGG AAG GAG AAG AAG AAG GCC GAG GCC ACC GAG CGC CGC GCG CTG 942 
104 
105 

107 AAG CGT CAG GCG CAC GAG GCG TGG AAG GCC ACG CAC GTG GGC CAC CTG 990 
Lys Arg Gin Ala His Glu Ala Trp Lys Ala Thr His Val Gly His Leu 
105 110 115 



75 80 85 

GU<J Ttiu aau uav i^o ^AG AAG GCC GAG GCC ACC GAG CGC CGC GCG CTG 

Ala Trp Lys Glu Lys Lys Lys Ala Glu Ala Thr Glu Arg Arg Ala Leu 

90 95 100 



108 
109 

111 GGC GCG GGC GTG CAC TGG GCG GAG GAC CGC CTG GCC GAC GCG TTC GAC 1038 

112 Gly Ala Gly Val His Trp Ala Glu Asp Arg Leu Ala Asp Ala Phe Asp 

113 120 125 130 

115 GTG CCC CAC CGC GAG GAG CGC GCC CGG GCC AAC GGC CTG ACG GAG CTG {^#A086 

116 Val Pro His Arg Glu Glu Arg Ala Arg Ala Asn Gly Leu Thr Glu Leu 

117 135 140 145 
118 

119 GAC TCC GCG GAG GCG CTG GCC AAG GCG CTG GGG CTG AGC GTC TCC AAG 1134 
Asp Ser Ala Glu Ala Leu Ala Lys Ala Leu Gly Leu Ser Val Ser Lys 
150 155 160 165 



120 
121 
122 

123 CTC CGC TGG TTC GCG TTC CAC CGG GAG GTC GAC ACG GCC ACG CAC TAC 1182 

124 Leu Arg Trp Phe Ala Phe His Arg Glu Val Asp Thr Ala Thr His Tyr 

125 170 175 180 
126 

127 GTG AGC TGG ACC ATT CCG AAG CGG GAC GGC AGC AAG CGC ACG ATT ACG 1230 

128 Val Ser Trp Thr He Pro Lys Arg Asp Gly Ser Lys Arg Thr He Thr 

129 185 190 195 
130 

131 TCC CCC AAG CCT GAG CTG AAG GCA GCG CAG CGC TGG GTG CTG TCC AAC 1278 

132 Ser Pro Lys Pro Glu Leu Lys Ala Ala Gin Arg Trp Val Leu Ser Asn 

133 200 205 210 
134 

135 GTC GTG GAG CGG CTG CCG GTC CAC GGC GCC GCC CAC GGC TTC GTG GCG 1326 

136 Val Val Glu Arg Leu Pro Val His Gly Ala Ala His Gly Phe Val Ala 

137 215 220 225 
138 

139 GGA CGC TCC ATC CTC ACC AAC GCG CTG GCC CAC CAG GGC GCG GAC GTC 1374 

140 Gly Arg Ser He Leu Thr Asn Ala Leu Ala His Gin Gly Ala Asp Val 

141 230 235 240 245 
142 

143 GTG GTC AAG GTG GAC CTC AAG GAC TTC TTC CCC TCC GTC ACC TGG CGC 1422 

144 Val Val Lys Val Asp Leu Lys Asp Phe Phe Pro Ser Val Thr Trp Arg 

145 250 255 260 
146 

147 CGG GTG AAG GGC CTG TTG CGC AAG GGC GGC CTG CGG GAG GGC ACG TCC 1470 

148 Arg Val Lys Gly Leu Leu Arg Lys Gly Gly Leu Arg Glu Gly Thr Ser 

149 265 270 275 
150 

151 ACG CTG CTG TCC CTC CTC TCC ACG GAA GCG CCG CGG GAG GCG GTC CAG 1518 

152 Thr Leu Leu Ser Leu Leu Ser Thr Glu Ala Pro Arg Glu Ala Val Gin 
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PAGE- 4 RAW SEQUENCE LISTING DATE: 12/20/96 

- • , - - - \.;PATENT APPLIGATION US/G8/269,118A TIME: 16:22:27: 



153 






280 










285 








154 
























155 


TTC 


CGC 


GGC 


AAG 


CTC 


CTG 


CAC 


GTC 


GCC 


AAG 


GGC 


156 


Fhe 


Arg 


Gly Lys 


Leu 


Leu 


His 


Val 


Ala 


Lys 


Gly 


157 




295 










300 










158 
























159 


CAG 


GGC 


GCC 


CCC 


ACG 


TCG 


CCC 


GGC 


ATC 


ACC 


AAC 


160 


Gin 


Gly 


Ala 


Pro 


Thr 


Ser 


Pro 


Gly 


He 


Thr 


Asn 


161 


310 








315 










320 


162 
























163 


CTC 


GAC 


AAG 


CGG 


CTG 


TCC 


GCC 


CTC 


GCG 


AAG 


CGG 


164 


Leu 


Asp 


Lys 


Arg 


Leu 


Ser 


Ala 


Leu 


Ala 


Lys 


Arg 


165 










330 










335 




166 
























167 


ACG 


CGC 


TAC 


GCG 


GAC 


GAC 


CTG 


ACC 


TTC 


TCC 


TGG 


168 


Thr 


Arg 


Tyr 


Ala 


Asp 


Asp 


Leu 


Thr 


Phe 


Ser 


Trp 


169 








345 










350 






170 
























171 


CCC 


AAG 


CCG 


CGG 


CGG 


ACG 


CAG 


CGT 


CCC 


CCC 


GTC 


172 


Pro 


Lys 


Pro 


Arg 


Arg 


Thr 


Gin 


Arg 


Pro 


Pro 


Val 


173 






360 










365 








174 
























175 


CGC 


GTC 


CAG 


GAA 


GTG 


GTG 


GAG 


GCG 


GAG 


GGC 


TTC 


176 


Arg 


Val 


Gin 


Glu 


Val 


Val 


Glu 


Ala 


Glu 


Gly 


Phe 


177 


375 










380 










178 
























179 


AAG 


ACG 


CGC 


GTC 


GCC 


CGC 


AAG 


GGC 


ACG 


CGG 


CAG 


180 


Lys 


Thr 


Arg 


Val 


Ala 


Arg 


Lys 


Gly Thr 


Arg 


Gin 


181 


390 










395 










400 


182 
























183 


GTC 


GTG 


AAT 


GCG 


GCG 


GGC 


AAG 


GAC 


GCG 


CCC 


GCG 


184 


Val 


Val 


Asn 


Ala 


Ala 


Gly Lys 


Asp 


Ala 


Pro 


Ala 


185 










410 










415 




186 
























187 


GAC 


GTC 


GTC 


CGC 


CAG 


CTC 


CGC 


GCC 


GCC 


ATC 


CAC 


188 


Asp 


Val 


Val 


Arg 


Gin 


Leu 


Arg 


Ala 


Ala 


He 


His 


189 








425 










430 






190 
























191 


AAG 


CCG 


GGC 


CGC 


GAG 


GGC 


GAG 


TCG 


CTC 


GAG 


CAG 


192 


Lys 


Pro 


Gly Arg 


Glu 


Gly Glu 


Ser 


Leu 


Glu 


Gin 


193 






440 










445 








194 
























195 


GCC 


TTC 


ATC 


CAC 


ATG 


ACG 


GAC 


CCG 


GCC 


AAG 


GGC 


196 


Ala 


Phe 


He 


His 


Met 


Thr 


Asp 


Pro 


Ala 


Lys 


Gly 


197 




455 










460 










198 
























199 


CAG 


CTC 


ACG 


GAG 


CTC 


GAG 


TCC 


ACG 


GCG 


AGC 


GCC 


200 


Gin 


Leu 


Thr 


Glu 


Leu 


Glu 


Ser 


Thr 


Ala 


Ser 


Ala 


201 


470 










475 










480 


202 
























203 


TGACGCTCAG 


CGCGCGTCCG TCGCCGACGT GCCGCGCGCC 


204 
























205 


CTCCGTCAGC 


CGGCGCGGGT AC 











INPUT SET: S14S48.raw 

290 

CGC GCC CTG CCC 1566 



305 



325 



340 



355 



370 



385 



405 



420 



435 



450 



465 



485 



1614 

1662 

1710 

1758 

1806 

1854 

1902 

1950 

1998 

2046 

2094 

2154 
2176 



•• 4 



p AGE: 5 , RAW SEQUENCE LISTING DATE- 12/20/96 

- - PATENT/APPLICATION US/08/269,118A TIME:i6:22:3h: 

INPUT SET; S14S48.raw 

206 
207 

208 (2) INFORMATION FOR SEQ ID NO: 2: 
209 

210 (i) SEQUENCE CHARACTERISTICS: 

211 (A) LENGTH: 263 amino acids 

212 (B) TYPE: amino acid 

213 (D) TOPOLOGY: linear 
214 

215 (ii) MOLECULE TYPE: protein 

216 
217 
218 

219 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

220 

221 Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val Lys Gin Trp Pro^S 

222 15 10 15 
223 

224 Leu Thr Glu Glu Lys He Lys Ala Leu Val Glu He Cys Thr Glu Met 

225 20 25 30 
226 

227 Glu Lys Glu Gly Lys He Ser Lys He Gly Pro Glu Asn Pro Tyr Asn 

228 35 40 45 
229 

230 Thr Pro Val Phe Ala He Lys Lys Lys Asp Ser Thr Lys Trp Arg Lys 

231 50 55 60 
232 

233 Leu Val A sp Phe Arg Glu Leu Asn Lys Arg Thr Gin Asp Phe Trp Glu 

234 65 70 75 80 
235 

236 Val Q ln Leu Gly He Pro His Pro Ala Gly Leu Lys Lys Lys Lys Ser 

237 85 90 95 
238 

239 Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser Val Pro Leu Asp 

24 <> 100 105 110 

241 

242 Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr He Pro Ser He Asn Asn 

243 115 120 125 
244 

245 Glu Thr Pro Gly He Arg Tyr Gin Tyr Asn Val Leu Pro Gin Gly Trp 

246 130 135 140 
247 

248 Lys Gly Ser Pro Ala He Phe Gin Ser Ser Met Thr Lys He Leu Glu 

249 150 155 160 
250 

251 Pro p he Lys Lys Gin Asn Pro Asp He Val He Tyr Gin Tyr Met Asp 

252 165 170 175 
253 

254 Asp Leu Tyr Val Gly Ser Asp Leu Glu He Gly Gin His Arg Thr Lys 

255 180 185 190 
256 

257 Ile Q lu Glu Leu Arg Gin His Leu Leu Arg Trp Gly Leu Thr Thr Pro 

258 195 200 205 



